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& ■ ABSTRACT 

^ | We present Hubble Space Telescope (HST) Advanced Camera for Surveys (ACS) and 

Wide Field Camera 3 (WFC3) observations of SDSS J1029+2623, a three-image quasar 
lens system produced by a foreground cluster at z = 0.584. Our strong lensing anal- 
ysis reveals 6 additional multiply imaged galaxies in addition to the multiply imaged 
quasar. We confirm the complex nature of the mass distribution of the lensing clus- 
ter, with a bimodal dark matter distribution which deviates from the Chandra X-ray 
surface brightness distribution. The Einstein radius of the lensing cluster is estimated 
■ to be #e = 15'.' 2 ± C/5 for the quasar redshift of z — 2.197. We derive a radial mass 

distribution from the combination of strong lensing, HST/ ACS weak lensing, and 
Subaru/Suprime-cam weak lensing analysis results, finding a best-fit virial mass of 
1 -ZWvir = 1.55^0 35 x 10 14 /j~ 1 Af Q and a concentration parameter of c v j r = 25.7^7 4 g 1 . 

The lensing mass estimate at the outer radius is smaller than the X-ray mass estimate 
by a factor of ~ 2. We ascribe this large mass discrepancy to shock heating of the 
intracluster gas during a merger, which is also suggested by the complex mass and gas 
distributions and the high value of the concentration parameter. In the HST image, 
we also identify a probable galaxy, GX, in the vicinity of the faintest quasar image C. 
In strong lens models, the inclusion of GX explains the anomalous flux ratios between 
the quasar images. The morphology of the highly elongated quasar host galaxy is also 
well reproduced. The best-fit model suggests large total magnifications of 30 for the 
quasar and 35 for the quasar host galaxy, and has an AB time delay consistent with 
the measured value. 

Key words: dark matter — galaxies: clusters: individual: SDSS J1029+2623 — 
gravitational lensing: strong — gravitational lensing: weak — quasars: individual: 
SDSS J1029+2623 — X-rays: galaxies: clusters 
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1 INTRODUCTION z s = 2.197 quasar produced by a foreground cluster of 

„„„„ , rz : n i 1 nr ■ n I k galaxies at zi = 0.584. Such "naked cusp" configurations 

SDSS J1029+2623 (Inad a et all 12006: Ogu ri et all f2008) 6 ,., , \ . . .„ . . * .. , . , 

, — , r — , are predicted to comprise a significant traction ol cluster- 

is a unique quasar lens system with three images of the sca i e quasar i enses Jpguri fc Keetonl liiol iLTetall [20071: 

M inor fc Kaplinghat1l2008l') . All thr ee quasar images are also 

* „ ., detected at radio ( Kratzer et alJ |2011T> and X-ray wave- 
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lengths <|Ota et al.|[20l3 ). and show mini-broad absorption 
line (mini-BAL) features in the Lya, SilV, and CIV emission 
lines. 

These earlier observations already implied that the lens- 
ing cluster is likely complex. The optical images clearly show 
two bright central galaxies in the lensing cluster. Follow-up 
spectroscopy found that the two galaxies have a velocity 
difference of At; ~ 2800 kms" 1 (|Oguri et alj|2008h . which 
might imply a high-speed collision of two massive haloes. 
Chandra X-ray observations show that the X-ray surface 
mass distribution is centred on one of the central galax- 
ies, with an additional X-ray subpeak ~ 9" Nor th- West of 
the m ain peak that has no optical counterpart (|Ota et alj 
2012). This complex X-ray morphology also suggests that 
the lensing cluster is in a merger. It is crucial to construct a 
reliable mass model from a lensing analysis in order to draw 
any conclusion on the dynamical state of the cluster. 

Another interesting feature of this system is the anoma- 
lous flux ratios seen in the quasar images. Simple mass mod- 
els predict the relative brightnesses of the three quasar im- 
ages A, B, and C should be B w C > A, whereas opti- 
cal, radio, and X-ray observations all found relative bright- 
nesses of B fs A > C . The fa ct that this "anomaly" is seen 
in r adio (|Kratzer et al.ll201lT ) and absorption-corrected X- 
ray (|Ota et alj 20121 ) flux ratios indicates that it is not due 
to extinction by any intervening matter. In addition, the 
rough agreement of the flux ratios between three different 
wavelengths implies th at it is not entirely due to stellar mi- 
crolensing (but see also lMotta et al.ll2012l ). particularly since 
microlensing generally leads to larg e differences betw een op- 
tical and X-ray flux ratios (e.g., IDai et aT] l2010al ). Thus 
the most plausible explanation for the anoma lous flux ratio 
is mi llilensing by (dark halo) substructures l|Kratzer et al.l 
l201ll ). An accurate macro lens model is crucial for breaking 
the degeneracy between the macro model uncertainties and 
the amount of substructure required to account for the flux 
ratios. 

In this paper, we present a new lensing analysis of 
SDSS J1029+2623 utilizing deep multiband Hubble Space 
Telescope (HST) observations of this system. Several mul- 
tiply imaged galaxies, which are newly identified from the 
HST images, are used to construct an accurate mass model 
of the core of the lensing cluster. Strong lensing results are 
then combined with weak lensing results from the HST im- 
ages and our earlier wide-field weak lensing analysis b ased 
on the Subaru/Suprime-cam images (|Qguri et alj|20l3 ). We 
then use the mass mode l to interpret the Chandra X-ray sur- 
face brightness profiles (|Ota et al.ll2012l ) and the anomalous 
flux ratios between the quasar images. 

The structure of this paper is as follows. After briefly de- 
scribing the observations in Section [2] we present the strong 
lensing analysis in Section [3] and the weak lensing analysis 
in Section 0] We then combine the various observations in 
Section [5] to discuss the properties of the lensing cluster. 
Section [6] is devoted to discussions of the anomalous flux 
ratio between the quasar images. Finally we summarize the 
results on Section]?] Throughout this paper we assume a flat 
cosmological model with matter density Qm = 0.275, cos- 
mological constant Qa = 0.725, and di mensionless Hubble 
constant h — 0.702 l|Komatsu et al.|[201ll ). All photometry is 
in the AB system without correcting for Galactic extinction. 
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Figure 1. The colour-magnitude diagram from the HST ACS im- 
ages. Here the magnitudes are 1" diameter aperture magnitudes. 
The dotted line indicates the colour cut used to eliminate cluster 
member galaxies for weak lensing analysis presented in Section [4] 



2 OBSERVATIONS 

We observed SDSS J1029+2623 with the Advanced Camera 
for Surveys (ACS) and the Wide Field Camera 3 (WFC3) 
on HST (GO-12195; PI Oguri). The observation consisted 
of 2 orbits (total exposure 5276 sec) of ACS/F475W imag- 
ing on 2011 May 15, 3 orbits (total exposure 8010 sec) of 
ACS/F814W imaging on 2011 April 17, and 2 orbits (to- 
tal exposure 5223.5 sec) of WFC3/F160W imaging on 2011 
April 15. We mostly follow the data reduction procedure de- 
scribed in ISchrabback et alj (j20ld ). which uses MultiDriz- 
zle (|Koekemoer et alj|2002h for cosmic ray removal, distor- 
tion removal and stacking. A pixel scale of ty/05 is adopted 
for both the ACS and WFC3 images. The ACS (WFC3) 
frames are drizzled using the lanczos3 (square) kernel, with 
a pixfrac of 1 (0.8). The ACS data are preprocessed using 
the pipeline CALACS, which also removes bias striping and 
provides a pixel-level char ge transfer inefficiency co rrection. 

We use SExtractor l|Bertin fc Arnoutsl Il996l ) to con- 
struct a photometric catalogue of the galaxies in the HST 
images. For each galaxy, we derive a total magnitude 
from the MAG_AUT0 parameter and a 1" diameter aper- 
ture magnitude. We show the colour-magnitude diagram 
of the SDSS J1029+2623 field from the two ACS images 
in Figure [T] The Figure shows a clear red-sequence of pas- 
sively evolving galaxies in the lensing cluster at mp475w — 

?TlF814W ~ 3. 

We also use wide-field Subaru Suprime-cam 
l|Mivazaki et al l |2002| ) images of SDSS J1029+2623 
for the analysis. These images consist of 1200 sec g-band, 
2700 sec r-band, and 1920 sec i-band images, with a seeing 
FWHM of 0'.'65 in r-band. T he weak lensi n g ana lysis of the 
Subaru data was presented in lOeuri et al.l (|2012l ). which we 
adopt in this paper. 

SDSS J1029+2623 was also observed with Chandra X- 
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Table 1. List of multiple images for strong lensing analysis. 



ID 


R.A. (J2000) a 


Decl. (J2000) a 


redshift 6 


A 


157.308139 


26.388441 


2.197 


B 


157.309398 


26.394600 




C 


157.309612 


26.394125 




1.1 


157.298104 


26.390875 


[1.93-2.36] 


1.2 


157.297918 


26.392583 




1.3 


157.300902 


26.397536 




2.1 


157.298228 


26.391494 


[1.84-2.27] 


2.2 


157.298197 


26.391783 




2.3 


157.301516 


26.397847 




3.1 


157.299103 


26.392536 


[2.62-3.24] 


.3.2 


157.307476 


26.391480 




3.3 


157.304173 


26.398397 




3.4 


157.301584 


26.388161 




3.5 


157.301770 


26.388063 




3.6 


157.301878 


26.388022 




3.7 


157.303243 


26.392097 




4.1 


157.299258 


26.392661 


[2.58-3.26] 


4.2 


157.307677 


26.391452 




4.3 


1 57 304390 


26 398286 




4.4 


157.302359 


26.387869 




4.5 


157.303531 


26.392152 




5.1 


157.302080 


26.394772 


[0.89-1.02] 


5.2 


157.300917 


26.392008 




5.3 


157.300964 


26.391730 




6.1 


157.309073 


26.395119 


2.197 


6.2 


157.309956 


26.392452 




6.3 


157.308576 


26.388813 





a See Figures [2] and \3\ for the configuration of the multiple image 
systems and the morphology of each image. 

b Redshift ranges in brackets are 2<r redshift ranges predicted by 
the strong lens models. 

ray observatory (ObsID 11755; PI Oguri) on 2010 March 11 
for 60 ksec. The X-ray image clearly reveals X-ray emission 
both from the lensing cluster and the three quasar images. 
The data reduc tion and basic sp ectral and image analysis 
were detailed in lOta et all (|2012T ). 



3 STRONG LENSING ANALYSIS 
3.1 Modelling method 

The main objective of the HST imaging observation was to 
constrain the central mass distribution of the lensing cluster 
using strong lensing information. For this purpose, we iden- 
tify multiply imaged galaxies in addition to the three quasar 
images based on their colours and morphologies combined 
with matching multiple image candidates while iteratively 
refining the mass models (see below for details of our mass 
model). In total, we identified 5 additional sets of multiply 
imaged galaxies (ID 1-5), which are shown in Figures [5] and 
[3] and summarized in Table [1] In addition, the HST image 
shown in Figure [2] exhibits a prominent host galaxy of the 
lensed quasar which is highly elongated due to lensing. In 
order to take account of the shape of the lensed host galaxy, 
we include an additional set of multiple images (ID 6) which 
roughly correspond to the edge of the host galaxy. To sum- 
marize, in this paper we use the positions of 27 multiple 
images of 7 systems as constraints. 



We para metrically m odel the lens using the public soft- 
ware glafic <|Ogurj|201(ih , although in the iterative pro- 
cess to identify the mult iple images we p artly used the 
public software lenstool (|jullo et alj [2007) as well. The 
mass model mainly consists of dark halo com p onent s mod- 
elled by elliptical iNavarroT Frenk. fc White] (|l997l . here- 
after NFW) profiles with the radial profile of p(r) oc 
r _1 (r + r s )~ 2 and member g alaxies modelled as ellipti- 
cal pseudo-Jaffe models (e.g., ICohn et alj [2001) with the 
radial profile of p(r) oc r~ 2 (r 2 + r 2 ^)^ 1 . Since the clus- 
ter core contains two bright galaxy concentrations, we in- 
clude two cluster-scale dark halo components centred at 
galaxy Gla (R.A. =157.305789, Decl -26.392602) and G2 
(R.A. =157.302083, Decl.=26.392344). Each dark halo com- 
ponent has 4 parameters: the ellipticity and position angle 
of their isodensity contours, the virial mass M v i r , and the 
concentration parameter c v i r - To reduce the number of pa- 
rameters, we assume that the velocity dispersion a and the 
cutoff radius r cut of the pseudo-Jaffe models scale with the 
luminosities of the galaxies as a oc L 1//4 and r cut oc L 1 ^ 2 , and 
regard the normalisations of the scaling relations as free pa- 
rameters. However, we do not apply these scaling relations 
for one member galaxy just North of images 3.4-3.6 because 
the locations of these images are very sensitive to the proper- 
ties of this galaxy. The velocity dispersion and cutoff radius 
of this particular galaxy are included as free parameters. The 
ellipticity and position angle of the pseudo-Jaffe model for 
each member galaxy are fixed to the value measured from 
the HST ACS/F814W image. In addition to the dark halo 
and member galaxy components, we include four perturba- 
tion terms (lens potential cA oc r 2 cos mS) with m = 2 (exter- 
nal shear), 3, 4, and 5 which effectively describe the asym- 
metries of cluster m ass distributions that are commonly seen 
in simulations (e.g., Menegh etti et all [20071. Since we have 
no spectroscopic redshift measurements for the multiply im- 
aged galaxies, the redshifts of the lensed galaxies are also 
treated as the free parameters. 

We use the positions of the 27 multiple images sum- 
marized in Table [T] as observational constraints. We assume 
positional errors of tf.'l for the quasar images and 0'.'4 for 
the galaxy images. These errors are larger than the measure- 
ment uncertainties, and are chosen to take account of any 
complexity of the lens potential that is not accounted for in 
the parametric mass model. For the quasar images, we in- 
clude flux ratios of A/B = 0.802 and C/B = 455 measured 
in th e Very Large Array (VLA) radio image l|Kratzer et al.l 
l201lf ). but assuming a conservative 20% error on the flux 
ratios. The assumed error is larger than the measurement 
errors (peak flux densities of 0.206, 0.257, 0.117 mjy/beam 
for images A, B, and C, with an error of 0.013 mjy/beam for 
each component) in order to take account of time variabili- 
ties as well as the complexity of the lens potential. Recently, 
we measured time dela y between quasar image s A and B as 
At A B = 744 ± 10 days (|Fohlmeister et alj|20ia ). We include 
this measurement as an observational constraint, although 
we assume a 5% error on AtAB, cr(AtAB) ~ 37 days, which 
is larger than the measurement error, because the time de- 
lay depends also on the assumed Hubble constant and is 
limited by cosmic var iance due to mat ter fluctuations along 
the line-of-sight (e.g.. lBar-Kana|[l996l ). In total, we have 57 
observational constraints and 39 model parameters (8 from 
the dark halo components, 4 from the member galaxy com- 
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Figure 2. Colour image of the SDSS J1029+2623 field from the HST ACS/F475W, ACS/F814W, and WFC3/F160W observations. 
North is up and West is to the right. The three quasar images are labelled by A, B, and C. Squares with ID numbers indicate the 
locations of identified multiple images (see also Table [TJ. Galaxies Gla, Gib, G2 are the central galaxies of the lensing cluster. Solid 
lines show the critical lines predicted by our best-fit mass model at z s = 2.197, the redshift of the strongly lensed quasar. 




Figure 3. Similar to Figure [2] but zooming in on the multiply imaged galaxies identified in this paper. 
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Figure 4. Contours of the best-fit surface mass density profile 
from strong lens modelling [thin blue) and the Chandra X-ray 
surface brightness profile (thick red) are overlaid on the HST 
ACS/F814W image. In the X-ray contours we removed the quasar 
images, but there are some subtraction residuals that distort the 
X-ray contours. 

ponents, 8 external perturbations, 5 source redshifts, and 
14 source positions). Thus, the models have 18 degrees of 
freedom. 



3.2 The best-fit model 

We derive the best-fit mass model by searching for the mini- 
mum x 2 - To speed-up the calculation, we estimate x 2 m the 
source plane, which has been shown to be reasonably accu- 
rate for mo st of strong lens modelling purposes (see, e.g., 
IOgurill2010l V We find a x 2 for the best-fit model of X 2 = 30, 
which is significantly larger than the number of degrees of 
freedom iVdof = 18. In fact, a large fraction of the x 2 comes 
from the constraints on the flux ratio. While our input flux 
ratios are A/B = 0.802 and C/B = 0.455, the best-fit model 
predicts A/B = 0.082 and C/B = 1.14. This results in 
a large x 2 contribution despite our conservative errors on 
the flux ratios, and indicates that the flux ratio anomaly 
found in previous simple mass models (|Oguri et alj 120081 ; 
IfCratzer et al.ll2~01ll ) persists even in this much more com- 
plex mass model. Once the flux ratios are removed from the 
observational constraints, the best-fit model yields x 2 = 10 
for iVdof = 18 — 2 = 16 degrees of freedom, suggesting that 
our choice of the positional errors on the image locations is 
reasonable. In particular, our best-fit model has a time delay 
of AtAB = 737 days that reproduces the observed AB time 
delay almost perfectly. We explore the flux ratio anomaly 
further in Section [5] 

The critical curves of the best-fit model are displayed 
in Figure [2] We find that contributions from both dark halo 
components to the critical curves are significant, and as a 
consequence, the centroid of the main critical curve respon- 
sible for the three quasar images falls between Gla and G2. 



In the best-fit model, the dark halo at G2 has a smaller 
virial mass but a larger concentration than the dark halo 
at Gla (see Table |2J. As a result, the halo of G2 con- 
tributes more mass inside the Einstein radius than the halo 
at Gla. Because the mass and concentration parameters 
constrained from strong lens modelling are quite degener- 
ate with each other, for a more robust comparison, we fix 
the concentration parameters to c v i r = 5 and find best-fit 
masses of M v \ r ~ 1.5 x 1O 14 /i _1 M0 for the dark halo at 
G2 and M vil ~ 1.0 x 1O 14 /i _1 M for the dark halo at Gla. 
Thus, the dark halo associated with G2 is more massive, 
although the effect of the dark halo at Gla is very signif- 
icant. We compare the surface mass densi ty of the best-fit 
model with the Chandra bright ness profile dOta et al.ll2012f l 
in Figure U As pointed out in lOta et all (|2012l ). the X-ray 
surface brightness profile of the main cluster component is 
centred at G2, with an additional subpeak North- West of the 
main peak, which appears to be different from the structure 
of the mass distribution found in the strong lens models. 
On the other hand, the position angles of the best-fit mass 
model, 9 e = 106.3 deg (measured East of North) for the dark 
halo at G2 and 105.7 deg for the dark halo at Gla, are in 
good agreement with the position angle of the X-ray surf ace 
brightness profile, 6 X = 110.6 ± 5.2 deg (|Ota et alj|20l3 ). 

An important quantity to characterise strong lens sys- 
tems is the Einstein radius Qe, although the definition of 
the Einstein radius is not unique, for such a complex lens 
system. In this paper, we define the Einstein radius as the 
radius such that the mean enclosed surface density equals 
the critical surface density, 

R(<fe.) = -L ( K(9)d0 = l, (1) 
™e J\9\<e E 

adopting the average of the locations of G2 and Gla 
(R.A. = 157.303936, Decl -26.392473) as the centre. The 
Einstein radius of the best-fit model is 6*e = 15'.' 2 for the 
quasar redshift of z s = 2.197, which is consistent with the 
extent of the main critical curve shown in Figure[2] However, 
we note that the choice of the centre is somewhat arbitrary. 
If wc move the centre for calculating the Einstein radius by 
±1" along the North-South and East-West directions, the 
value changes in the range 14'.'8 < 8e < 15" 4. 

We explore the errors on the model parameters em- 
ploying a Markov Chain Monte Carlo technique. We adopt 
Metropolis-Hastings sampling with a multivariate-Gaussian 
for the proposal distribution. From this procedure we 
estimate the possible range of redshifts for the multi- 
ply imaged galaxies, which we summarize in Table [T] 
We also estimate errors on key model parameters of the 
best-fit strong lens mass model, which are summarized 
in Table [5] For comparison, we also derive photometric 
redshifts of the multiply imaged galaxies based on the 
three-band HST photometry using the public code EAZY 
l|Brammer. van Dokkum. fc Coppill2008l ). and find that the 
photometric redshifts are in good agreement with the model 
predictions for ID 1, 2, and 5. The photometric redshifts 
are lower than the model predictions for ID 2 and 3, al- 
though the upper bounds on their photometric redshifts of 
z p < 2.6 (2a) marginally overlap with the expected redshift 
ranges from strong lens modelling. In addition, we find that 
our strong lens modelling constrains the Einstein radius to 
the 2a range of 14'.'5 < #e < 15'.'5. Considering the addi- 



© RAS, MNRAS 000,[THT3] 



6 M. Oguri et al. 



Table 2. Best-fit values of key model parameters from the strong lens models. The position angle 8 e is measured East of North. Aside 
from the galaxy near image 3.4-3.6, the parameters of the other member galaxies are scaled from those for G2 as a cc L 1 / 4 for the 
velocity dispersion and r cu t oc L 1 / 2 for the cutoff radius. The uncertainties are 2a confidence limits from the Markov Chain Monte Carlo 
analysis. 



component 


M vir 

(loWfi-'Me) 




a 

(kms" 1 ) 


r cu t 
(arcsec) 


e 


8e 

(deg) 


dark halo at Gl 
dark halo at G2 
galaxy G2 
galaxy near 3.4-3.6 


1 14+5.35 
1,1 -2.20 

3 04+ 1 ' 79 


2 94+ 7 - 29 
z - 3 -0.78 


291±£ 
"IIS 


13 4 +5.7 
> 1.4 


74+ - 07 
U -' 4 -0.15 

U ** J -— 0.33 
(ee 0.252) 

(ee 0.313) 


106.3^ 
10B.7±|;g 
(ee -57.6) 
(ee -167.9) 



tional uncertainties on the cluster centre mentioned above, 
we adopt #e = 15'.'2±0'.'5 (la error) for the combined lensing 
analysis in Section [5] 



4 WEAK LENSING ANALYSIS 

4.1 HST weak lensing measurement 

We carry out the weak lensing analysi s using the HST 
ACS/ F814W image. We closely follow ISchrabback et all 
for the shear measurement procedure, which is 
based on the object ca talogue generated by SExtractor 
llBertin fc Arnoutall 996) and the weak lensing method out- 
lined in Kaiser et al.l (1993). ln particular, we model tempo- 
ral point spread function (PSF) variations based on a prin- 
cipal component analysis of dense stellar fields. To correct 
for the systematic shear underestimates of the KSB method 
due to noise bias, we include an empirical calibration bias 
as a function of the signal-to-noise ratio of each galaxy. The 
method was tested against HST ACS-like simulation data 
and was shown to be accurate t o better than 2% over the 
entire magnitude range used (see[Schrabback ct al. 201(3, for 
details). 

A possible systematic effect in cluster weak lensing 
analysis is a dilut ion of the weak lensing signals by mem- 



0.1 



r [rr'Mpc] 
0.2 



0.4 



ber galaxies (e.g.. iBroadhurst et al.l [20051 ; iMedezinski et al.l 
120071 . 120101 1. In addition to member galaxies in the red- 
sequence (see Figure [TJ, the galaxy catalogue may contain 
cluster m ember galaxies up to ~ 0.5 mag bluer than the red 
sequence (Mcdczinski ct al. 2007). Hence, in order to reduce 
the dilution, we apply a colour cut of 



W.P475W — mF8i4w < — 0.15(rrtF8i4w — 22) + 2.2, 



(2) 



for our weak lensing analysis, where the magnitudes are the 
1" diameter aperture magnitudes shown in Figure[T] In addi- 
tion, we apply a total magnitude cut of rriF8i4w < 27 where 
the number counts of the galaxies in the data turns over. 
The final source galaxy number density of ~ 75 arcmin -2 
after applying these selection limits is still large enough to 
allow high-resolution weak lensing measurements near the 
cluster core. 

We estimate the mean depth (Di a /D oa ) of our source 
galaxy sample using the redshi ft distribution of galaxies as a 
function of rriF&uw derived bv lSchrabback et al.l (2010). Al- 
though the colour cut (Equation [2]) can in principle modify 
the redshift distribution, we here ignore this effect because 
a majority of galaxies outside the red-sequence are not re- 
moved by this colour cut (see Figure [T]) and therefore the 
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Figure 5. Tangential shear profiles from the weak lensing analysis 
of the HST ACS/F814W image. Filled squares show the shear 
profile after the colour cut to remove cluster member galaxies 
(equation [2}, while open squares are the shear profile without the 
colour cut. The lower panel shows the radial profile of the 45° 
rotated component. 



effect of the colour cut on the redshift distribution is ex- 
pected to be small. As a cross check, we also estimate the 
mean depth using a phot ometric redshift catalogue in the 
Hubble Ultra Deep Field (|Coe et al.ll2006| ). After applying 
the same colour (Equation [5J) and magnitude cuts, we ob- 
tain a mean depth that is quite consistent with the earlier 
estimate. We then estimate the effective source redshift at 
which the distance ratio becomes equal to the mean depth. 
We find an effective source redshift of 8 s , e ff = 1.06, which 
we adopt for our HST weak lensing analysis throughout the 
paper. 



4.2 Result 

Figure [5] shows the tangential shear profile from the HST 
ACS/F814W source galaxy catalogue. Here we again adopt 
the average of the locations of G2 and Gla as the clus- 
ter centre, although we note that the tangential shear pro- 
file is not very different even if we adopt the location of 
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Figure 6. Similar to Figure [4] but comparing the surface 
mass density profile from the weak lensing analysis of the 
HST ACS/F814W image (thin blue) to the Chandra X-ray sur- 
face brightness profile (thick red). The mass map is Gaussian- 
smoothed with a = 8". 



Gla or G2 as a cluster centre. The tangential shear detec- 
tions are statistically significant, with increasing shear to- 
ward the cluster centre. To check the impact of dilution by 
cluster member galaxies, we derived a tangential shear pro- 
file from the source galaxy catalogue without the colour cut 
defined in equation ((2)1 . The resulting shear profile, which is 
also shown in Figure [5] has a significantly weaker amplitude 
compared with the profile including the colour cut, and the 
effect is stronger near the cluster centre. This confirms ear- 
lier claims that proper selection of galaxies to remove cluster 
members is essential for accurate weak lensing analysis. We 
also check the radial number density profile of the source 
galaxies and find that the number counts after the colour 
cut decreases toward the cluster ce ntre, as expected from 
the le nsing magnification effect fe.g.. lUmetsu fc Broadhursj 
2008) and masking by cluster member galaxies. 

Next we explore the two-dimensional mass map recon- 
structed from weak lens ing, using the inversion technique of 
iKaiser fc Squires! (|l99ot ). As shown in Figure [6] there is a 
significant weak lensing mass peak around galaxy Gla and 
G2, which is consistent with the strong lens modelling re- 
sult shown in Section [3] For more quantitative comparisons 
with the Chandra X-ray surface brightness profile, we fit 
the binned, unsmoothed two-dimensional shear map with 
an elliptic al NFW profile, fo llowi ng the methodolog y de- 
veloped in lQguri et all (|2010l l and lOguri etHI (|2012h . The 
model involves 6 free parameters: the virial mass and con- 
centration parameter, the ellipticity and position angle, and 
the centroid. We find that the HST weak lensing data con- 
strain the centroid of the surface mass density profile to 



(Aa;, Ay) = (1.2_ 7 ' 4 , 1.7I g ; 9 ), in arcseconds from the aver- 
age of the locations of Gla and G2, where positive directions 
of Aa; and Ay correspond to West and North, respectively. 
Note that (Aa;, Ay)=(q=5.975, ±0.465) correspond to the 
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Figure 7. Constraints on the virial mass M v - Ir and the concen- 
tration parameter c v - 1T from the HST strong lensing (SL) anal- 
ysis, HST weak lensing (WL), and Subaru weak lensing. Here 
the strong lensing constraint comes from the constraint on the 
Einstein radius, E = 15'.'2 ± 0"5 for z s = 2.197. The weak lens- 
ing constraints are derived from azimuthally averaged tangential 
shear profiles (see Figure [8}. For each constraint, we show lcr 
and 2cr ranges in this parameter space. The innermost contours 
show the combined constraints. Dotted lines indicate theoretically 
predicted and observed average mass-conce ntration rela t ions f or 
strong lensing clusters at z ~ 0.5 derived in lQguri et al ] lj2012T ). 



Table 3. Best-fit mass and concentration parameters from com- 
bined lensing analysis (see also Figure 01. Errors correspond to 
lcr confidence level. 



constraints 


M vir 

(io 14 ^- 1 ^) 


Cvir 


HST-WL 
Subaru- WL 
.ffiST+Subaru-WL 
aST-WL+SL 
ffST+Subaru-WL+SL 


i o.r+0.89 

1 Jd -0 .28 

2 00+ - 70 
1 74+0.55 

-, o 9 +0.63 
1 " 3Z -0.14 
-1 ccr+0.40 
i - oo -0.35 


or. q + 8.9 

11 5+ 14 ' 2 
il -°-5.0 
17Q +19.3 
- L '- u -6.5 

34.7^ 
25.71^ 



locations of Gla and G2. Thus, the two-dimensional weak 
lensing analysis indicates that the mass centroid is quite con- 
sistent with the average location of Gla and G2, but it is 
also consistent with the location of Gla or G2. In addition, 
we find that the shear map prefers an elongated mass dis- 
tribution, with ellipticity e = 0.72^Q2 7 and position angle 
9 e = 93^jj deg. The position angle is in good agreement 
with both the position angle from strong lens modelling and 
that of the X-ray surface brightness profile. 
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5 COMBINED LENSING ANALYSIS 
5.1 Radial density profile 

We now combine the constraints from strong lensing (Sec- 
tion [3} , HST^jweak lending (Section [4]), and Subaru weak 
lensing l|Oguri et al.ll2012h to derive constraints on the over- 
all radial density profile of the lensing cluster. For the 
strong lensing result, we conservatively use the constraint 
on the Einstein radius of #e = 15" 2 ± 0'.'5 for the source 
redshift of z s = 2.197. For the HST and Subaru weak 
lensing data, we use the tangential shear profiles as con- 
straints. We use these data to constrain the virial mass 



r [h"'Mpc] 



Mvir and the concentration parameter 



assuming the 



NFW profile. The fit was performed for the mass range of 
10 13 /i _1 M Q < Mvir < W 16 h~ 1 M Q and the concentration 
parameter range of 0.01 < c v i r < 39.8. 

We show constraints in the M v ir-c v ir plane in Figure 
and summarize the resulting constraints on M V [ T and c v ir in 
Table O The individual constraints are quite degenerate in 
this plane, but along slightly different directions so that we 
obtain a tighter constraint when combining the three con- 



straints. We find M v 



1.55 



+0.40 
-0.35 



x 10 14 ft -1 M ra and c v 



25.7+ 



from the combined analysis. The large concentra- 



tion parameter value for the virial mass of ~ W h M@ is 
in line with a re cent study the M v i r -c v i r relation for strong 
lensing clusters <|Oguri et alj|2012h . which can be ex plained 
by th e effect of b aryon cooling and central galaxies ( jFedelil 
l2012l , but see also lDuffv et alfooid : iMcCarthy et aT1l2010h . 
However, the high concentration for this cluster should be 
interpreted cautiously given the complex core structure with 
two density peaks. We compare the tangential shear profiles 
with the best-fit model prediction in Figure [8] The three 
observational constraints are complementary and they are 
consistent with each other where they overlap. The best-fit 
NFW profile reproduces the observations for a wide range 
in radii. 

Here we discuss several systematic effects that can affect 
our results. One is the redshift distribution of source galaxies 
for weak lensing analysis. Since the area of the field used for 
our weak lensing analysis is small, the redshift distribution 
in this field can be different from that of a mean distribution. 
Another effect is a multiplicative shear measurement bias, 
which is estimated to be < 5% for our Subaru weak lens- 
ing measurements. Also the HST weak lensing measurement 
probes the region g > 0.1, where weak lensing measurements 
are less tested and therefore less reliable. To estimate a pos- 
sible impact of these systematic errors, we consider an ex- 
treme situation where both the HST and Subaru weak lens- 
ing measurements are offset by ±10%, and find the resulting 
shifts of the best-fit virial mass to be ~ ±0.3 x lO 14 k _1 M . 
The systematic error for this case is still comparable to the 
la statistical error, implying that these systematic errors 
are not significant. 



5.2 Comparison with X-ray mass 

lOta et all (|2012l ') presented the Chandra X-ray analysis of 
SDSS J1029±2623, and derived a mass profile assuming hy- 
drostatic equilibrium and is othermality . Figur e compares 
the X-ray mass profile from lOta et all (|2012h with the re- 
sult of the combined lensing analysis in Section [571] We find 




9 [arcmin] 



Figure 8. The tangential shear profile of the best-fit NFW model 
(solid line) as compared to the HST (filled squares) and Subaru 
(open circles) shear profiles and the reduced tangential shear 
predicted by the best-fit strong lens model (filled triangles) for 
z s = 1.1. The best-fit shear profile is computed for z s = 1.1, 
which is close to the effective source redshifts of z s = 1.06 and 
1.19 for the HST and Subaru weak lensing data. The shaded 
region indicates the \o range. The lower panel shows the radial 
profile of the 45° rotated component g x ■ 



that the mass profiles derived from lensing and X-ray dif- 
fer significantly with each other. While the enclosed masses 
agree at the radius of ~ 100/i _1 kpc that roughly corre- 
sponds to the Einstein radius of this system, the enclosed 
masses inferred from the X-ray data are a factor of ~ 2 
larger than those inferred from the combined lensing anal- 
ysis at radii r > r-2500, where r25oo is defined by the radius 
within which the average density is 2500 times the critical 
density at the cluster redshift. We note that the lensing de- 
rived mass implies an X-ray temperature and luminosity of 
T ~ 2 — 3 keV and Lx ~ 10 44 erg s -1 based on X -ray scaling 



relations (e.g.. iDai. Kochanek. fc Morgan! 120071 ). which are 
significantly lower than the observed X-ray temperat ure of 
T ~ 8.1 keV and L x ~ 10 45 erg s _1 (|Ota et alj|20l3 ). It is 
also worth noting that recent systematic weak lensing anal- 
ysis of clusters at z < 1 found somewhat s maller weak lens - 
ing masses for a given X-ray temperature (| Jee et al.ll201ll ). 
which is qualitatively similar to our result, although the dif- 
ference appears to be much smaller than found here. 

There are several effects that can lead to a system- 
atic difference between the X-ray and lensing masses. One 
of the most significant effects for X-ray cluster mass mea- 
surements is any violation of the assumption of hydrostatic 
equilibrium due the presence of non-thermal pressure sup- 
port. However, this effect typically leads to an underesti- 
mate of the X-ray mass, particu l arly in the outski rts of 
clusters (e.g.. iMahdavi et all 120081 ; IZhang et alj|2010h . and 
would only make the discrepancy larger. Another possibility 
is our assumption of isothermality. In fact, X-ray tempera- 
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Figure 9. The enclosed mass within a sphere of radius r from the 
Chandra X-ray analysis and from the combined lensing analysis 
(solid line; see Section 15,111 , assuming a spherically symmetric 
mass distribution. For the Chandra X-ra y analysis, we sho w the 
results for the isothermal /3-model from lOta et al 1 ll20ll open 



circle ) and a /3-model with the temperature profile of lBurns et al.l 
l|20ld . open squares). The errors in the X-ray masses come from 
the statistical errors in the X-ray temperature measurement. For 
reference, we also show the cluster radii r v j r , rsoo, and r2500i 
which are computed from the best-fit lensing mass profile, as well 
as the Einstein radius te- 



tures generally decrease at large radii, but our Chandra data 
were not sensitive enough to detect such a decrease. To ex- 
amine this effect, we adopt a te mperature profile o btained 
from hydrodynamic simulations ([Burns et al.ll2010h . which 
appears to be consiste nt with recent X-ray o bservations of 
cluster outskirts (e.g.. [Xkamatsu et al.l [201ll ) . and recalcu- 
late the X-ray mass profile. While this reduces the difference 
at the center, it cannot explain the overall difference between 
the X-ray and lensing masses. It is also difficult to explain 
the difference by a triaxial halo, because the large c v i r im- 
plies that the major-axis of the cluster is more likely to be 
aligned with the line-of-sight directio n, in which case the 
lensing mass should be overestimated (|Oguri et al.l l2005). 

The most likely explanation for the mass discrepancy 
is shock heating of the intracluster gas during a merger. 
Numerical simulations show significant boosts of X-ray lu- 
minosity and temperature ~ 1 Gyr after mergers, which 
can lead to signif i cant o v erestimates of X-ray masses (e.g.. 



Ricker & Sarazin 



20ld ; iRasia et al.l 



2001; Takizawa, Nagino, & Matsushita 
201 ll ; iNelson et alj l2012h . Observation- 



ally, there are several clusters showing signs of ongo- 
ing mergers that also have sig nificantly higher X-ra 
masses than lensing masse s (e.g ., lOkabe fc Umetsull200 
lOkabe et all l201ll ; ISoucaill I2012T ). Indeed the lensing clus 
ter of SDSS J0129+2623 shows hints of an ongoing merger, 
including the bimodal nature of cluster cores, the complex 
X-ray morphology, a possible offset between mass and X-ray 
centroids, and the large G1-G2 velocity difference. In addi- 
tion, a line-of-sight merger can naturally e xplain the high 
concentration estimates for this cluster (e.g. jKing &: Corlesa 



Spectroscopy of many more cluster member galax- 
ies is needed to understand this complex cluster further. If 
this interpretation is correct, the agreement between X-ray 
and lensing masses near the Einstein radius might just be a 
coincidence, in that both the masses are overestimated by 
merger shock heating and halo elongation along the line-of- 
sight, respectively. 



5.3 Gas-to-mass ratio 

Another useful cross-check of our interpretation is provided 
by the gas-to-mass ratio, / gas (< r) = A/ gas (< r)/M to t(< 
r), because it is expected to roughl y match the cosmic 
baryon fraction SI^/SIm — 0.167 l|Komatsu et al.l 1201 ll ) 
for massive clusters, and also because M gas appears to be 
the most promisin g cluster mass proxy (Oka be et ai]|2010l : 
Fabi an et alj|201lf ). We use t he gas mas s profi le implied by 
the isothermal /3-model from lOta et all (|2012T ) to estimate 
a gas mass within the radius rsoo = 0.49/i _1 Mpc from the 
combined lensing analysis. We find a gas mass fraction of 
/gaa(< rsoo) = 0.060 ± 0.014, which is significantly smaller 
than the cosmic baryon fraction. The gas mass fraction is 
slightly increased to / gas ( < r 5 oo) = . 077 ± 0.017 when 
the temperature profile of iBurns et all (|2010h is used to 
derive the total mass, but the value is still low. On the 
other hand, when the lensing mass estimate is used for 
the total mass, we obtain a gas mass fraction of / gaa (< 
rsoo) = 0.158 ± 0.038, which is in b etter agreement with 



,f ga a estimat es in other clusters (e.g., IVikhlinin et all 



Allen et all 120081 ; lEttori et all l2009l;lumetsu et all 



2006; 



2009; 



Dai et al.ll2010bl ; lokabe et alj|2010l : iPratt et alj|20ich . This 



further supports our interpretation that the X-ray derived 
mass is significantly overestimated due to shock heating of 
the intracluster gas during a merger. 



6 IMPLICATIONS FOR THE FLUX RATIO 
ANOMALY 

6.1 Searching for substructures 

Our str ong lens models i n Section E3 confirm the earlier 
claim s (|Oguri et all 120081 ; iKratzer et all l201ll ; lOta et all 
|2_012j) that a substructure is required to explain the anoma- 
lous flux ratios between the three quasar images. In order 
to search for possible substructures, we subtract the quasars 
from each HST image and carefully examine the residual im- 
ages. In the ACS/F814W image, we find a probable galaxy 
in the vicinity of image C, as shown in Figure 1101 We fit 
the galaxy, named GX, assuming a S ersic profile with index 
n = 4 using galfit (|Peng et alll2002l ). and find the centroid 
of the galaxy to be ~ 0'.'4 from image C (R.A. =157.309482, 
Decl.=26. 394158), and that the magnitude of the galaxy is 
W-F814W ~ 24.3. These estimates have large uncertainties 
due to the significant residuals from the subtraction of im- 
age C (see Figure [TU}. In the strong lens models we estimate 
that galaxy G2 with mFsuw = 19.62 has a velocity disper- 
sion of a = 291 km s - , which implies a velocity dispersion 
of a ~ 100 km s" 1 for GX assuming a oc L ' . The inferred 
velocity dispersion of GX is large enough to mean that it 
will significantly perturb the flux of image C. 
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Figure 10. Top: The HST ACS /F814W image around the merg- 
ing quasar images B and C. Bottom: The image after subtracting 
quasar images. The probable galaxy GX is marked by a circle. 
We used a nearby star as a PSF template. 



6.2 Mass modelling 

We next explore the impact of GX on the strong lens mod- 
els. Here we assume the same mass model as used in Sec- 
tion but include an additional component modelled by 
pseudo-Jaffe model at the location of GX with the veloc- 
ity dispersion and the cutoff radius as free parameters. We 
find that the best-fit model has \ 2 = 7.5 including the flux 
ratios, which is significantly smaller than the x 2 of the orig- 
inal best-fit model (x 2 = 30) and the large improvement is 
due to improvem ents in fitting t h e flux ratios. This confirms 
the argument by iKratzer et al] (|201lh that a substructure 
in the vicinity of image C can explain the anomalous flux 
ratio. The velocity dispersion of galaxy GX in the best-fit 
model is a — 76kms~ 1 , which is broadly consistent with the 
velocity dispersion expected from the magnitude. We sum- 
marize the flux ratios of the best-fit mass models, as well as 
the observed flux ratios at various wavelengths, in Figure [TT1 
The best-fit models with and without galaxy GX have 
quite different total magnification factors. For the model 
without galaxy GX, the magnifications for the quasar images 
are [ia = 6.8, [ib = 82.9, and \ic = 94.7, whereas for the 
model with galaxy GX, they are [ia = 10.4, /j,b = 13.8, and 
[ic = 5.5. The total magnifications are fj, to t = 184.4 for the 
model without GX and /i t ot = 29.7 for the model with GX. 
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Figure 11. Flux ratios of the best-fit strong lens models with 
(solid) and without (dotted) galaxy GX are compared with the 
observed flux ratios in different wavelengths. The top panel shows 
the flux ratio between images A and B, while the bottom panel 
shows the flux ratio between images B and C. Observed flux ra- 
tios are obtained fr om the VLA radio image (open circle) by 
IKratzer et al. the HST images (filled squares) presented 

in this paper, gro und-based optical images (filled triangles) by 
Ogu ri et all l l200Sf) . and the Chandra X -ray image with (black 
open square) and without (gre y thin open square) the absorption 
correction bv lOta et al. (2012). 



Substructures near merging image pairs can have 
large effects on time delays be tween the images l|Oguril 
120071 ; iKeeton fc Moustak as 2009). T his wa s indeed the case 
for SDSS J1004+4112 (|lnada et all 120031 ), another large- 
separation lensed quasar produced by a massive cluster of 
galaxies, in which a time delay between the smallest im- 
age pair was found to be affected by the perturbation from 
nearby cluster member galaxies (|Fohlmeister et al1l2007l ). 
We find that our best-fit mass models predict the time de- 
lay between image B and C as Atsc = 0.7 day for the model 
without GX, and Atsc = 7.7 day for the model with GX. 
In both cases, image B leads image C as expected from the 
image geometry. On the other hand, GX has a negligible 
impact on the AB time delay. The large difference of Atsc 
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implies that measuring the BC time delay will greatly help 
constrain the substructure that affects these quasar images. 



6.3 Morphology of the lensed host galaxy 

An interesting feature of SDSS J1029+2623 is the presence 
of a highly elongated host galaxy, which is most prominent 
in the HST WFC3/F160W image. Here we examine whether 
our best-fit models reproduce the morphology of the lensed 
host galaxy. To do so, we include the host galaxy modelled 
by a Sersic profile, and optimise the parameters (the total 
flux, size, ellipticity and position angle) of the Sersic pro- 
file assuming the best-fit mass models, both with and with- 
out galaxy GX. We fix the centroid of the host galaxy in 
the source plane to the best-fit source quasar position. The 
optimisation is conducted on a pixel-by-pixel basis using a 
binned HST WFC3/F160W image with a pixel scale of 0'.'2. 
The model-predicted images are convolved with a PSF as- 
suming a Moffat profile with a FWHM of 0'.'18 and B = 2, 
before comparing with the observed image. 

The result is shown in Figure [T2j We find that our 
best-fit models reproduce the morphology of the lensed host 
galaxy quite well. The match is better for the model with 
galaxy GX, which further supports its presence and the sig- 
nificant impact of galaxy GX on this lens system. In partic- 
ular, we note that the observed host galaxy shows signs of a 
kink near image C (see also Figure [TO)) which is reproduced 
in our best-fit model with galaxy GX. We estimate the total 
magnitude of the lensed host galaxy as rriFiGow ~ 18.5 and 
the lensing-corrected original magnitude as mFi6ow ~ 22.4. 
The best-fit effective radius and Sersic index are r e = 0'.'27 
and n = 0.88, respectively. The presence of galaxy GX has 
a negligible impact on the total magnification of the host 
galaxy. The highly magnified host galaxy provides a very 
unique opportunity to study the relation between supermas- 
sive black holes and host galaxies in a distant quasar (e.g., 
iPeng et alJl200d : lRoss et alj|2009h . 



7 CONCLUSION 

We have presented a detailed gravitational lensing analysis 
of SDSS J1029+2623, the largest-separation lensed quasar 
system known (image separation 9 = 22'.'5, lens redshift 
zi = 0.584, and source redshift z a — 2.197), based on new 
HST ACS and WFC3 observations. With the help of the 
high-resolution HST images, we have identified 6 new mul- 
tiply imaged galaxies in addition to the three-image lensed 
quasar. Based on the newly identified multiple images, we 
construct an accurate mass model at the cluster core. We 
find dark haloes with roughly similar masses should be as- 
sociated with both of the bright central galaxies. The strong 
lens models allow us to estimate the Einstein radius of the 
overall lens as #e = 15" 2 ± 0'.'5. Our best-fit strong lens 
model successfully reproduces the observed time delay be- 
tw een quasar image A and B which was recently measured 
by iFohlmeister et af (2012). We have also detected a sig- 
nificant weak lensing signal in the HST ACS image. The 
mass map obtained from the strong and weak lensing analy- 
sis app ears to be differe nt from the X-ray surface brightness 
profile (|Ota et al.ll2012h . suggesting an ongoing merger. 



We have constrained the radial mass profile of the clus- 
ter from the HST strong and w eak lensing analys is as well as 
the Subaru weak lensing result (|Oguri et alj 2012). The com- 
bination of the three lensing constraints enabled a robust de- 
termination of the radial mass profile, with a best-fit virial 
mass of Mvir = 1.5512'q? x 10 14 /i -1 Mq and a concentration 



parameter of c v i r = 25. 7_ 



The lensing derived mass pro- 



file differs significantly from the mass profile inferred from 
the Chandra X-ray data assuming hydrostatic equilibrium 
and isothermality, such that the X-ray mass is a factor of ~ 2 
larger at outer radii of r2soo ^ r < rsoo. We ascribe the mass 
discrepancy to shock heating of the intracluster gas during 
a merger, which leads to significant overestimates of X-ray 
cluster masses. Our interpretation is supported by the small 
cluster gas mass fraction implied by the X-ray estimates of 
total masses, with / gas (< r^oo) = 0.060±0.014, as compared 
to / g as(< ?"50o) = 0.158 ±0.038 using the lensing estimate of 
the total mass. A merger along the line-of-sight can also ex- 
plain the high concentration parameter value found for the 
cluster, and is consistent with the large velocity difference 
of Aw ~ 28 00 kms" 1 between the bright central galaxies 
Gl and G2 |Oguri et alj|200sl ). The merger interpretation 
for this strong lens selected cluster, in turn, may imply a 
large imp act of cluster mergers on strong lensing cross sec- 
tions (e.g-. lTorri et al.ll2004lFedeli et al.ll2006l ; lRedlich et all 
l2012h . 

If the merger interpretation is correct, one question is 
whether the large velocity difference is compatible with the 
theoretical expectations. There are several works examin- 
ing the distribut ion of velocity di f ferenc e s between close 



halo pairs (e.g.. lHavashi fc White] 120061 ; iLee fe Komatsul 



to pa 

|2010| ; [Thompson fc Nagamind 12013) . which indicate that 
a relative velocity of Av ~ 2800 kms -1 for haloes with 
Mvir > 10 14 /i _1 M Q is quite rare. Therefore, it is important 
to make more quantitative comparisons taking account of 
the redshift and mass of SDSS J1029+2623 in order to check 
whether this large velocity difference can be explained in the 
framework of the standard A-dominated cold dark matter 

cosmology. 

We confirm the earlier claim s l|Oguri et al . 2008; 
iKratzer et"ai1l201ll ; lota et alj|2012l ) that the observed flux 
ratios between the quasar images cannot be reproduced by 
strong lens models without any substructure in the vicinity 
of the quasar images. From a careful inspection of the HST 
images, we have identified a probable galaxy GX which is 
only ~ 0'.'4 from quasar image C. A mass model including 
GX successfully reproduces the observed flux ratios, thereby 
resolving the flux ratio anomaly. While GX has little effec t 
on the AB time delay measured by Fohlmcist er et all l|2012h , 
the BC time delay is a sensitive probe of its effects. We have 
also checked the morphology of the lensed host galaxy and 
found a very good agreement with the model prediction. The 
best-fit model predict the total magnification of the quasar 
as fttot = 30 and that of the lensed quasar host galaxy as 
^tot — 35. 

Finally we note that additional observations are essen- 
tial to better characterise this unique quasar-cluster lens 
system. For instance, the measurement of the time delay 
between quasar images B and C will be important for con- 
straining the properties of galaxy GX. The merger interpre- 
tation for the lensing cluster should be checked by extensive 
spectroscopy of cluster member galaxies. Spectroscopy of the 
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Figure 12. First panel: The i75T WFC3/F160W image showing the lensed host galaxy of the quasar at z s = 2.197. All the quasar 
components are subtracted using a nearby star as a PSF template. Second panel: The morphology of the lensed host galaxy in the best-fit 
mass model without galaxy GX from Section [3] The locations of the three quasar images are marked by circles. Third panel: Similar to 
the middle panel, but the case for the best-fit model with galaxy GX. Fourth panel: The HST WFC3/F160W image after subtracting 
the best-fit host galaxy model without galaxy GX. Fifth panel: The HST WFC3/F160W image after subtracting the best-fit host galaxy 
model with galaxy GX. Note that all the panels are presented with the same grey-scale stretch. 



multiply imaged galaxies identified in this paper is impor- 
tant both for more robust mass models and fo r possible cos- 
mological constraints (e.g.. Ijullo et alJl201Ch . Furthermore, 
the highly elongated host galaxy makes this lens system an 
ideal site to conduct detailed explorat ions of a quasar host 
galaxy at z ~ 2 fe.g.. lRoss et al.1 12009'). 
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